Search CORE

58 research outputs found

Weighted Minimum-Length Rearrangement Scenarios

Author: Chateau Annie
Simonaitis Pijus
Swenson Krister M.
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 19th International Workshop on Algorithms in Bioinformatics (WABI 2019)
Publication date: 01/01/2019
Field of study

We present the first known model of genome rearrangement with an arbitrary real-valued weight function on the rearrangements. It is based on the dominant model for the mathematical and algorithmic study of genome rearrangement, Double Cut and Join (DCJ). Our objective function is the sum or product of the weights of the DCJs in an evolutionary scenario, and the function can be minimized or maximized. If the likelihood of observing an independent DCJ was estimated based on biological conditions, for example, then this objective function could be the likelihood of observing the independent DCJs together in a scenario. We present an O(n^4)-time dynamic programming algorithm solving the Minimum Cost Parsimonious Scenario (MCPS) problem for co-tailed genomes with n genes (or syntenic blocks). Combining this with our previous work on MCPS yields a polynomial-time algorithm for general genomes. The key theoretical contribution is a novel link between the parsimonious DCJ (or 2-break) scenarios and quadrangulations of a regular polygon. To demonstrate that our algorithm is fast enough to treat biological data, we run it on syntenic blocks constructed for Human paired with Chimpanzee, Gibbon, Mouse, and Chicken. We argue that the Human and Gibbon pair is a particularly interesting model for the study of weighted genome rearrangements

HAL-ENS-LYON

Dagstuhl Research Online Publication Server

Ancestral Gene Synteny Reconstruction Improves Extant Species Scaffolding

Author: Anselmetti Yoann
Berry Vincent
Bérard Sèverine
Chateau Annie
Chauve Cedric
Tannier Eric
Publication venue
Publication date: 01/01/2015
Field of study

We exploit the methodological similarity between ancestral genome reconstruction and extant genome scaffolding. We present a method, called ARt-DeCo that constructs neighborhood relationships between genes or contigs, in both ancestral and extant genomes, in a phylogenetic context. It is able to handle dozens of complete genomes, including genes with complex histories, by using gene phylogenies reconciled with a species tree, that is, annotated with speciation, duplication and loss events. Reconstructed ancestral or extant synteny comes with a support computed from an exhaustive exploration of the solution space. We compare our method with a previously published one that follows the same goal on a small number of genomes with universal unicopy genes. Then we test it on the whole Ensembl database, by proposing partial ancestral genome structures, as well as a more complete scaffolding for many partially assembled genomes on 69 eukaryote species. We carefully analyze a couple of extant adjacencies proposed by our method, and show that they are indeed real links in the extant genomes, that were missing in the current assembly. On a reduced data set of 39 eutherian mammals, we estimate the precision and sensitivity of ARt-DeCo by simulating a fragmentation in some well assembled genomes, and measure how many adjacencies are recovered. We find a very high precision, while the sensitivity depends on the quality of the data and on the proximity of closely related genomes

Crossref

Springer - Publisher Connector

INRIA a CCSD electronic archive server

Simon Fraser University Institutional Repository

A step towards a reinforcement learning de novo genome assembler

Author: Alves Ronnie
Carvalho Andre
Chateau Annie
Padovani Kleber
Reali Anna
Xavier Roberto
Publication venue
Publication date: 09/06/2021
Field of study

The use of reinforcement learning has proven to be very promising for solving complex activities without human supervision during their learning process. However, their successful applications are predominantly focused on fictional and entertainment problems - such as games. Based on the above, this work aims to shed light on the application of reinforcement learning to solve this relevant real-world problem, the genome assembly. By expanding the only approach found in the literature that addresses this problem, we carefully explored the aspects of intelligent agent learning, performed by the Q-learning algorithm, to understand its suitability to be applied in scenarios whose characteristics are more similar to those faced by real genome projects. The improvements proposed here include changing the previously proposed reward system and including state space exploration optimization strategies based on dynamic pruning and mutual collaboration with evolutionary computing. These investigations were tried on 23 new environments with larger inputs than those used previously. All these environments are freely available on the internet for the evolution of this research by the scientific community. The results suggest consistent performance progress using the proposed improvements, however, they also demonstrate the limitations of them, especially related to the high dimensionality of state and action spaces. We also present, later, the paths that can be traced to tackle genome assembly efficiently in real scenarios considering recent, successfully reinforcement learning applications - including deep reinforcement learning - from other domains dealing with high-dimensional inputs

arXiv.org e-Print Archive

Comment la reconstruction de génomes ancestraux peut aider à l'assemblage de génomes actuels

Author: Anselmetti Yoann
Berry Vincent
Bérard Sèverine
Chateau Annie
Chauve Cedric
Tannier Eric
Publication venue: HAL CCSD
Publication date: 01/01/2016
Field of study

National audienceLe document est un résumé étend

INRIA a CCSD electronic archive server

An integer linear programming approach for genome scaffolding

Author: Briot Nicolas
Chateau Annie
Coletta Remi
De Givry Simon
Leleux Philippe
Schiex Thomas
Publication venue: HAL CCSD
Publication date: 08/09/2014
Field of study

This paper presents a simple and fast approach for genome scaffolding, combining constraint modeling and simple graph manipulation. We model the scaffolding problem as an optimization problem on a graph built from a paired-end reads alignment on contigs, then describe an heuristic to solve this problem with the iterative combination of local constraints solving and cycle breaking phases. We tested our approach on a benchmark of various genomes, and compared it with several usual scaffolders. The proposed method is quick, flexible, and provides results comparable to other scaffolders in terms of quality. In addition, contrarily to state of the art approaches that require dedicated servers, it can be run on a basic notebook computer even for large genomes

INRIA a CCSD electronic archive server

HAL Descartes

ProdInra

Hal-Diderot

Aligning the unalignable: bacteriophage whole genome alignments

Author: AE Darling
Anne Bergeron
Annie Chateau
B Kehr
B Morgenstern
BA Galler
C Lee
C Notredame
D Botstein
D Veesler
ER Gansner
F Sievers
G Gremme
GF Hatfull
GF Hatfull
GF Hatfull
H Chiapello
J Kahankova
J Kececioglu
JH Grose
JH Grose
Krister M. Swenson
M Hohl
Nicolas Pompidor
Paul Guertin
RC Edgar
RE Tarjan
SR Casjens
Sèverine Bérard
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Computation of Perfect DCJ Rearrangement Scenarios with Linear and Circular Chromosomes

Author: Annie Chateau
Bergeron A.
Bernt M.
Bouvel M.
Bouvel M.
Bérard S.
Bérard S.
Cedric Chauve
Christophe Paul
Eric Tannier
Figeac M.
Habib M.
Hannenhalli S.
Hoberman R.
Mixtacki J.
Sagot M.-F.
Sèverine Bérard
Warren R.
Publication venue: 'Mary Ann Liebert Inc'
Publication date
Field of study

Crossref